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(54) Mobile data terminals with text-to-speech capability 



(57) A mobile data terminal mountable on a mobile 
platform for transmitting, receiving and converting text 
data strings into an audible signals. The mobile data ter- 
minal comprises a radio modem (7) for receiving text 



data strings through a communication network and a 
memory (9) for storing the received data strings. The 
text data strings are associated with pre-stored speech 
segments for generating audible signals representative 
of the speech segments. 
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Description 

FIELD AND BACKGROUND OF THE INVENTION 

5 The present invention relates to the field of mobile data terminals. 

Mobile services such as, for example, trucking and courier services are required to maintain some form of com- 
munication channel with their dispatcher in order to receive instructions, on the one hand, and to update the dispatcher 
as to their location and service activities, on the other. The data transferred, in both directions, is referred to as mobile 
data and the mobile units that transmit and receive the mobile data are known as Mobile Data Terminals (MDT's). 

io Generally, text data received by an MDT from a dispatcher is displayed on a screen just as in a paging device. The 
displayed message is replied to by depressing a suitable reply key on a key board. Each key corresponds to a standard 
message for responding to the displayed message. Conventional MDTs are available from such companies as Glanair; 
Racotec; Ericsson, NJ; Coded Communications, Los Angeles, CA; Motorola, Chicago, IL. 

The received text data displayed on an MDT has to be read by the user which is dangerous if he/she is driving at 

is the time. In fact, trying to read whilst driving has resulted in fatalities. Clearly then, there is a necessity for providing 
the mobile user with the option of hearing the received text data. The process wherein text data is converted to an 
audio signal representative of a human voice is known as "text-to-speech D conversion. Text-to-speech conversion is 
known in the art and has been applied to stationary equipment (see, e.g., "Best Speech™ 0 T-T-S, copyright 1991, 
Berkeley Speech Technologies Inc., U.S.A.) using a synthesized voice. However, there are no known applications of 

20 text-to-speech technologies to mobile systems. 

There is accordingly a long felt need to incorporate a text-to-speech capability in conventional mobile data termi- 
nals. 

SUMMARY OF THE INVENTION 

25 

It is an object of the present invention to provide a Mobile Data Terminal that displays a received message visually 
and additionally converts it to an audible signal in the form of a human voice (speech) or a synthesized voice. 

In accordance with the first aspect of the present invention there is provided a mobile data terminal for receiving 
and converting transmission data indicative of text data strings into an audible signal comprising: 

30 

a receiver for receiving said transmission data through a communication network; 
a first memory for storing said received transmission data; 

a processor for obtaining from said stored transmission data speech segments corresponding to said text data 
strings; and 

35 an audio generator for generating audible signals representative of said speech segments. 

Generally, said transmission data includes data indicative of words. 

If desired, said transmission data includes data indicative of speech segments. 

Further if desired, said speech segments are phonemes. 
40 The transmission data is indicative of text data strings which in general include text and/or speech segment infor- 

mation. However, this is by no means binding and it is conceivable that the transmission data also contain other data 
representative of text data strings such as, for example, text segments. 

If desired, said transmission data includes data indicative of text segments and said data indicative of text segments 
are associated with at least one of syllables and words. 
45 Preferably, said communication network is associated with at least one of Cellular Digital Packet Data (CDPD), 

satellite, Mobitex data, Ardis data, Specialized Mobile Radio, GSM, PCS. 

If desired, said communication network is a wired network. 

In the following description and claims, reference is made to the terms data compression and data coding. The 
term data compression is understood to refer to all content insensitive techniques whereby a given stream of data, 

50 independent of its contents, is bitwise compressed. The term data coding is understood to refer to all content sensitive 
techniques whereby a given stream of data is compressed with reference to specific qualities of the data contents. A 
nonbinding example of data compression is the Ziv-Lempel technique. A nonbinding example of data coding of a data 
stream containing text data (e.g., words, speech segments, text segments, or any combination thereof) would be to 
associate with each of the contents of the data stream, or a combination thereof, a code (e.g., a number). Data coding 

55 employs a look up table whose contents are text data and their associated codes. It will be appreciated that the pro- 
cedure of coding also covers the possibility of encryption of the text data. 

Preferably, said transmission data is coded. If the transmission data is coded the mobile data terminal further 
includes a decoder, and The coded transmission data is decoded in the mobile data terminal. 
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Generally, said words are classified to at least two groups, a first group including a priori known words, constituting 
first group words, and a second group including new words, constituting second group words, and wherein the trans- 
mission data with respect to each one of the first group words is a code representative of speech segments that cor- 
respond to a complete first group word, and the transmission data with respect to each one of the second group words 
s is at least one code representative of respective at least one speech segment that correspond to at least a portion of 
said second group word. 

If desired, speech segments that correspond to a first group word are extracted from said coded transmission data 
and said mobile data terminal further includes a feeder for feeding said extracted speech segments to said audio 
generator for generating audio signals representative of said speech segments. 
10 Alternatively, speech segments that correspond to a second group word are extracted from said coded transmission 

data and said mobile data terminal further includes a feeder for feeding said extracted speech segments to said audio 
generator for generating audio signals representative of said speech segments. 

Preferably, said audible signals are associated with at least one of synthesized speech signals, speech signals. 

Coding and compression can be used advantageously in combination. A text data string can be reduced in bit 
75 length by coding it. The coded text data string can then further be reduced in length by compressing it. 

If desired, said transmission data is compressed and the mobile data terminal further comprises a decompressor 
for decompressing the compressed transmission data. 

Optionally, the mobile data terminal further comprises: 

20 a transmitter for transmitting transmission data; 

a coder for coding transmission data; and 
a compressor for compressing transmission data. 

In accordance with a second aspect of the present invention, there is provided a method for transmitting, receiving 
25 and converting transmission data indicative of text data strings originated by or received from a first device, into audible 
signals indicative of said data text strings in at least one second device; either or both of said first device and the at 
least one second device being a portable wireless communication device mouritable on a mobile platform; at least one 
of said first device and said second device includes a table of speech segments corresponding each to a word or a 
portion thereof; the method comprising the following steps: 

30 

(i) reducing said text data strings to words; 

(ii) associating with said words corresponding speech segments; 

(iii) transmitting through a communication network from said first device to the at least one second device said 
transmission data; 

35 (jv) receiving, through said communication network, in the or each second device said transmission data; 

(v) generating in said at least one second device audio signals representative of said speech segments. 

If desired, said step (i) is executed in said first device. 
Further if desired, said step (ii) is executed in said first device. 
^0 Alternatively, said step (ii) is executed in said at least one second device. 

Optionally, said steps (i) and (ii) are executed in said at least one second device. 

It should be appreciated that either of the steps (i) and (ii) can be carried out partially in the first device and partially 
in the at least one second device. 

If desired, said transmission data includes data indicative of words. 
45 Also if desired, said transmission data includes data indicative of speech segments. 

Further if desired, said transmission data includes data indicative of text segments and said data indicative of text 
segments are associated with at least one of syllables and words. 

Preferably, said communication network is associated with at least one of Cellular Digital Packet Data (CDPD), 
satellite, Mobitex data, Ardis data, Specialized Mobile Radio, GSM, PCS. 
50 |f desired, said communication network is a wired network. 

Preferably, said transmission data is coded, wherein said coded transmission data is decoded in said at least one 
second device. 

Preferably, said transmission data is also compressed. 

Generally, said words are classified to at least two groups, a first group including a priori known words, constituting 
55 a first group words, and a second group including new words, constituting second group words, and wherein the trans- 
mission data with respect to each one of the first group words is a code representative of speech segments that cor- 
respond to a complete first group word, and the transmission data with respect to each one of the second group words 
is at least one code representative of a respective at least one speech segment that corresponds to at least a portion 
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of said second group word. 

If desired, said audible signals are associated with at least one of synthesized speech signals, speech signals. 
If desired, said speech segments are phonemes. 

In some circumstances it may be convenient that the transmission data wholly comprises words, whereas in others 
s it may comprise speech segments. On the other hand it is often desirable that the transmission data comprise both 
words and speech segments. The latter case is especially true of frequently used words. To reduce air time costs it is 
preferable to code the messages by coding both the word and its speech segments in one code. On the other hand 
for words that are not frequently used it is advantageous to code messages by coding speech segments but not the 
words. Since coding and decoding entail the storage of look-up tables the number of codes that can be stored will be 
70 determined by memory constraints. 

It should be noted that in the present invention transmission data is preferably wirelessly transmitted and received 
but can also be transmitted and received by wire. The final product is an audio signal indicative of the transmission 
data. This should be compared with a similar scenario in which the same text is transmitted as an audio signal, typically 
in coded form. Transmitting a message as an audio signal requires a much larger bandwidth than iransmitting the same 
is message as an equivalent text string or transmission data incorporating the text string and its associated speech 
segments. Hence, in accordance with the present invention a much smaller bandwidth is required to transmit a given 
message than would be required when using conventional audio transmission techniques for the same message. 

In view of the ever growing demand for the allocation of frequency bands for communications and the resulting 
overcrowding of the usable spectrum, it is readily appreciated that the present invention presents an effective way of 
20 reducing the demand for wide frequency bands. Furthermore, the compression of the text data and/or speech segments 
considerably reduces air time costs. 

It should be noted that for convenience and simplicity of illustration the order in which the steps of the method of 
the invention are carried out are not necessarily as specified. 

25 BRIEF DESCRIPTION OF THE DRAWINGS 

For a better understanding the invention will now be described, by way of example only, with reference to the 
accompanying drawings and appendix in which: 

30 Fig. 1 Illustrates a typical scenario for transmitting data from a dispatcher to a mobile data terminal; 

Fig. 2 Shows a schematic block diagram of a the structure of a typical Mobile Data Terminal; 

Fig. 3 Shows a generalized block diagram of the text-to-speech procedure; 

Fig. 4 Illustrates a typical scenario for transmitting data from a mobile data terminal to a dispatcher; 

Fig. 5 Illustrates a typical scenario for transmitting data from one mobile data terminal to another mobile data 
35 terminal; 

Fig. 6 Shows a schematic block diagram of the main steps involved in transmitting a coded message string; and 
Fig. 7 Shows a schematic block diagram of the main steps involved in receiving and displaying a coded message 
string. 

40 DETAILED DESCRIPTION OF THE INVENTION 

Attention is first drawn to Fig. 1 illustrating a typical scenario for transmitting data from a dispatcher to a mobile 
data terminal. A dispatcher 1 prepares a message, i.e. transmission data, in the form of text data, which is conveyed 
via an X.25, a leased line or a radio link 2 to a communication network 3, from where it is broadcast. The broadcasted 

45 text data (i.e., transmission data) 4 is received by a Mobile DaJa Terminal 5. The transmission data 4 is indicative of 
text data strings which in general include words, syllables, speech segments, text segments or any combination thereof. 

Communication network 3 is preferably a wireless network such as one of the following: Cellular Digital Packet 
Data (CDPD), satellite, Mobitex data, Ardis data, Specialized Mobile Radio, GSM, PCS. However, a wired communi- 
cation network such as a telephone system can also be used by connecting the MDT 5 to a telephone outlet socket. 

so Attention is now drawn to Fig. 2 showing a schematic block diagram of the structure of a typical MDT 5. A micro- 

controller 6 (e.g. a micro processor of the 80C51 family, commercially available from, e.g., Motorola or Intel) controls 
all the peripheral devices: the radio modem 7, which serves as a receiver for the transmission data, the Read Only 
Memory (ROM) 8, the Random Access Memory (RAM) 9, the audio generator 10 consisting of a digital-to-analog circuit 
and an audio amplifier, the Liquid Crystal Display (LCD) 11 and the keyboard 12. Also shown is an antenna 14 and a 

55 loudspeaker 15. The controller 6 is connected to the peripheral devices by a bus 13. 

The text-to-speech procedure will now be described with reference to Fig. 3 showing a generalized block diagram 
of the procedure, and occasionally also to Fig. 2. A text string received by the antenna 1 4 is transferred to, and stored 
in the memory 8. Each sentence is broken down into words 1 6, by employing the space between the words as a criterion 
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for defining a word in a sentence. In the next stage, 17, the words are parsed into text segments by known perse 
techniques. An example of a text-to-speech table for converting the text segments to speech segments is given in the 
appendix. The text-to-speech conversion table is searched for text segments that match the left hand side of the table. 
Speech segments corresponding to these text segments appear on the right hand side enclosed in quotes. For example, 
5 consider the word EXCELLENT. The text segments matching this word, and the corresponding speech segments are: 



E 


= //E//='EH' 


X 


= //X//='K-S' 


c 


= //C/+/ = 'S' 


E 


= //E//='EH' 


L 


= Hill 0 V 


L 


= /L/UV=" 


E 


= //E//='EH' 


N 


= HUH = *N' 


T 


= //T// = T. 



The operations of breaking down, or separation, of each sentence in the received text string, into words, the parsing 
of the words into text segments and the obtaining of the associated speech segments from the text-to-speech table is 
performed by the micro-controller (processor) 6. 

20 in order to establish where to stress the word, a possible, but by no means exhaustive, rule would be to parse the 

word as follows: EX - CEL - LENT. According to this rule the third syllable from the end is stressed when pronouncing 
the word. In the next stage, each speech segment is associated, 18, with a prerecorded voice segment 19 located in 
the memory 8. The associated prerecorded voice segments are fed successively by a feeder, incorporated in the micro- 
controller 6,to the audio generator (10 in Fig. 2 and 20 in Fig. 3) and henceforth to the loudspeaker 15 to give rise to 

25 a continuous voice signal representative of the original received text string. The operator of the MDT may choose 
between a visual display of the message on the LCD 11, or an audio rendering of the message via the loudspeaker 
15, or both, by the appropriate depression of keys on the keyboard 12. 

The application of speech-to-text conversion as applied to an MDT is not limited to the transmission of simple text 
files from the dispatcher to the mobile worker but can clearly be applied to an Internet connection wherein electronic 

30 mail is sent to the mobile worker. 

The text-to-speech procedure as described with reference to Fig. 3 is a general procedure and is by no means 
restricted to a message sent from a dispatcher to an MDT as shown in Fig. 1 but can be also applied to a message 
sent from an MDT to a dispatcher, as shown in Fig. 4, or from one MDT to another as shown in Fig. 5. 

In practice the transmission, or broadcasting, of a text string can be costly, especially when a large number of 

35 MDT's (say several thousand) is involved. Since messages sent between dispatchers and MDT's and between MDT's 
use, in general, a well defined vocabulary one possible way of reducing communication expenses would be to code 
the words in such a way that the coded message is shorter than the original text message. This can be attained by 
ensuring that the code for each word has a smaller bit than the total bit length occupied by the word. Coding and 
ensuing decoding of transmission data is performed by a coder and decoder, respectively, which are incorporated in 

40 micro-controller 6 (Fig. 2). 

Coding then reduces the number of bytes of information to be transmitted for a given message, thereby reducing 
air time costs which are a function of the number of bytes transmitted. 

In accordance with a preferred embodiment the coding is based upon classifying words into three categories: 
frequently used words, infrequently used words and words that do not fall into the first two categories. For convenience 

45 three tables of coding lists are defined. List A, contains frequently used words together with their speech segments 
and code. List B, contains infrequently used words and their speech segments but no code, and list C contains a list 
of speech segments and codes, for constructing words that belong neither to list A or list B. It should be noted that in 
all cases the term "speech segment" should be understood to include the case of phonemes. Tables 1 , 2 and 3 illustrate 
examples of lists A, B and C respectively. 

50 in accordance with this embodiment each code is represented by two bytes of information. A word belonging to 

list A is given a "classification bit" having a value of 1 which is added to the most significant bit (MSB) of the code. 
Since a byte contains 8 bits and the classification bit takes up 1 bit a code has only 15 bits free for code information. 
This means that list A can contain a total of 2 15 =32 K codes representing 32 K words. 

A word that does not belong to list A is given a classification bit having a value of zero and therefore its code's 

55 MSB remains unchanged. 

The method for transmitting a coded message, or partially coded message, is illustrated in Fig. 6. The message 
is entered through a keyboard (step 30). The message is broken down into separate words (step 32) and each word 
in the message is checked to determine whether it belongs to list A (step 34). If the word does belong to list A, then 
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the code representing the word is retrieved from list A and a 1 is added to the code's MSB (step 36). 

If the word does not belong to list A, then it is checked to see if it belongs to list B (step 38). If the word does belong 
to list B then a string of letters is prepared and the word's speech segments are retrieved from list B (step 40). If the 
word does not belong to list B, then a text-to-speech segment procedure is applied to the word (step 42). One possible 

s procedure for converting words to speech segments is that described with reference to Fig. 3, step 17. A string of 
letters of the word is now prepared together with the speech segment's code which are retrieved from list C (step 44). 
Whether the words belong to list B or not, the number of letters in the word and the number of speech segments is 
registered (step 46). Finally, together with coded words that belong to list A, a complete coded string (or partially coded 
string) of the message is constructed and transmitted (step 48). 

10 The method for receiving a coded, or partially coded, message and converting it to text or audio form is illustrated 

in Fig. 7. The string is received (step 50) and a check is carried out for entries in the string belonging to list A (step 
52). If the MSB of an entry being checked is equal to 1 , then the entry is coded and belongs to list A and the code for 
the word belonging to list A is extracted from the string (step 54). The letters and speech segments of the word corre- 
sponding to the extracted code are also retrieved from list A (step 56). 

75 |f the MSB of an entry being checked is not equal to 1 , then the letters and speech segment codes for that entry 

are retrieved from the received string (step 58). 

In the following reference is made both to Fig. 2 and Fig. 7. A complete message is formed in RAM 9 from the 
retrieved letters and speech segments (step 60). The message takes on two forms. One, is in text form for displaying 
on the LCD 11 and the other is in speech segment form for conversion to an audio signal for an audio rendering of the 

20 message via loudspeaker 1 5. By appropriate depression of keys on the keyboard 1 2 the text message can be visually 
displayed (step 62), or audio conversion software (18, 19 in Fig. 3) is applied to the speech segments to obtain voice 
samples (step 64) which are transferred to a digital-to-analog circuit and amplifier (10, 15 in Fig. 2 and step 66 in Fig. 
7). If desired both a visual and an audio display can be requested. 

To exemplify the use of lists A, B and C, consider the following sentence: 

25 YOUR BUDDY ABDULLAH. 

It should be noted that for the sake of illustration a Hexadecimal base is used for the codes. 

The word YOUR belongs to list A and is therefore represented by the Hex code ODAC. However, since it belongs 
to list A, a 1 has to be added to its MSB hence it will be transmitted as 8D.AC where the comma differentiates between 
consecutive bytes. 

30 The word BUDDY belongs to list B and would be transmitted as follows: 

05,04,66,85,68,68,89,00,0^03,58,00,14,00,32 

35 where the first byte 05 is the number of letters in the word and the second byte 04 is the number of speech segments 
in the word. The five numbers that follow the first two bytes, that is 66,85,68,68,89 are the ASCII values of the letters 
B, U, D, D, Y respectively. The remaining eight bytes are the Hexadecimal representation of the coded values of the 
speech segments of the word BUDDY and are taken directly from list B. 

The word ABDULLAH belongs neither to list A nor to list B and would be transmitted as follows: 



40 



08,05,65,66,68,85,76,76,65,72,00,03,00,0^00,14,01^5 



where the first byte 08 is the number of letters in the word and the second byte 05 is the number of speech segments 
45 jn the word. The eight numbers following the first two bytes are the ASCII values of the letters A, B, D, U, L, L, A, H 
respectively. The remaining eight bytes are the Hexadecimal representation of the coded values of the speech seg- 
ments of the word ABDULLAH. Since the word ABDULLAH belongs neither to list A nor to list B,. a text-to-speech 
segment conversion procedure has to be applied to the word in order to obtain the speech segments. The codes for 
these speech segments are obtained from list C. It should be noted, that since neither the word BUDDY nor the word 
50 ABDULLAH belong to list A, then in neither of these cases is a 1 added to the MSB of the first byte of the string of 
bytes representing the word. 

The three lists A, B and C are stored by all communicating units, i.e. both by the MDT's and by the dispatcher. It 
should be noted that words belonging to list A are fully coded, in the sense that the word together with its speech 
segments are represented by a code of two bytes, no matter how long the word is. Words not belonging to list A are 
55 partially coded in the sense that although the letters of the words are transmitted in full, the speech segments are 
coded, each speech segment being represented by two bytes. For words not belonging to list A, the difference between 
those words that belong to list B and those that do not, is that for words belonging to list B the speech segments are 
known, whereas for words that do not belong to list B the speech segments of the words are not known and a text-to- 
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speech segment procedure has to be applied to them to obtain the speech segments. In all cases, a further saving is 

noted in that the space between words does not have to be recorded in the string to be transmitted. 

As well as being coded, or as an alternative to being coded, The transmission data can also be compressed using, 

e.g, the Ziv-Lempel technique. Compressing and ensuing decompressing of transmission data is performed by a com- 
s pressor and decompressor, respectively, which are incorporated in micro-controller 6 (Fig. 2). 

The reason for not including all words in list A is simply due to memory limitations, hence list C is always required. 

List B is useful for saving the computational time required by the text-to-speech segment procedure. In general, the 

contents of lists A and B are not rigid and periodic review of the statistics of the usage of words is carried out and the 

lists are appropriately updated. 
io Although in the foregoing description the use of the English language was exemplified for speech-to-text conversion 

as applied to an MDT, it is clear that the description applies equally well to other languages by using a suitable set of 

rules for transforming parsed words and syllables of that language into their phonetic representation. 

It will be appreciated that although the conversion of text data to an audio signal representative of synthesized 

speech or of human speech has been illustrated with respect to mobile data terminals, it is by no means restricted to 
75 the latter and can equally well be applied to any type of mobile computer, from a small two way pager device with a 

proprietary operating system via a PDA type terminal to a full electronic notebook, laptop computer and the like. 



TABLE 1 



WORD 


SPEECH SEGMENT 


A 


a- 






BUDDY 


b u d//e- 






ZOO 


zooNI 



TABLE 2 



35 



40 



45 



SO 



55 



HEXCODE 


DECIMAL CODE 


WORD 


SPEECH SEGMENT 


00 01 


1 


AT 


at// 


00 02 


2 


ALL 


0*1// 


00 03 


3 


CALL 


keys 










00 06 


6 


MR. 


mi^ ter// 










03E8 


1,000 


THE 


tae-// 










DAC 


3,500 


YOUR 


y c/Hll 
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TABLE 3 





HEX CODE 


DECIMAL CODE 


SPEECH SEGMENT 


5 


00 01 


1 


a- 












.00 03 


3 


a 


10 










00 OA 


10 


b ! 




00 OD 


13 


d 


15 










00 14 


20 


du 




00 32 


50 


//e- 


20 










00 3F 


63 


f 




00 4B 


75 


//i 


25 










01 F4 


500 


I 


30 


01 F5 


501 


lu// 




02 08 


520 


o*r 


35 


02 12 


530 


Ml 




02 42 


578 


taeV/ 


40 


02 B5 


693 


s 




03 58 


856 


u 


45 


04 3F 


1,087 


y 



so 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



**** Special symbols **** 



APPENDIX 



* 
+ 



Ona or snora vowals [ASIOUY] 
one of Z, I, Y: a front vcval 
Zaro or mora consonants [5CDFGH^X]LMNFQR5TVWX2] 



Ono 
an 



consonant 
i of B, V, 



D, G ; J, Ij, M, N, R, W, Z: a voiced 



ccn 



sonan* 
& 



One of ER, E, ES, ED, 
One cf S, C ; G, Z, X/ 
One o: T y S, R, D, L, 
a consonant influencing following u 

**** C rulas 



ING,- ELY: a suffix 
J, CH, SH: a sibiant 
Z, N, J, TH, CH, SH: 



dbdfcx* 



**** A rules **** 



1 

2 
3 
4 
5 
6 
7 
3 
5 
10 
^2. 
12 
13 
14 
15 
1 5 
17 
13 
13 
20 
21 
22 
23 
2-4 
25 
2 5 
27 
23 
29 
30 

31 
32 
33 



'EE-R' 

= 'n-s' 

'UK' 
AW 

= ' SE-K-EE ' 
y EX' 



//A/ / = 'UH'- 
/ /ARE/ / - 'AH-R' 
/ /AR/O/ = 'UH-R' 
//AR/#/ - 
/ '/AS/// 
//A/WA/ =■ 
//AW// = 
/ : /ANY// 

i/xr+fi = 

/# : / ALLY/ / - 'UH-L-EZ' 
/ /AL/#/ = 'UH-L' 
//AGAIN// = 'UE-G-EH-K' 
/#:/AG/E/ = 'IH-J' 
//A/^+:j5/ = 'AE' 
/ S/A/-+/ = 'EY' 
/ /ARR// = .'UH-R' 
//ARR// = .'AE-R' 
/ :/AR// = ' AK-R' 
//AR/ / = 'ER' 
/ / AR / / = 'AE-R' 
//AIR// = 'EK-R' 
//AI// - '£Y' 
//AY// — ' EY' 
//AU// = 'AW 
/#:/AL/ / = 'UH-L' 
ft: /XLS/ I = 'UE-L-2' 
/ /AIiX/ / = ' AW-K' 
//AL/V - 'AW-L' 
/ :/A3LE// =■ 'EY-3-0H-L' 
//ABLE// - 'UK-3-UE-L' 

//ANG/+/ - 'EY-N-J' 

/ C/ATEE/ / = 'AE-TH-ZE' 

//A// = 'AS' 
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iii* B rules **** 

1 / /B2/*#/ = 'B-IH' 

2 //being// - ' b-ee-ie-ng ' 

3 / /both/ / - 'e-oh-t:-:' 

4 / /cuS/f/ - 'B-IE-2' 

5 //BUIL// = 'S-IH-L' 
S //B// = '3' 



1 / /CH/'/ = 'K' 

2 /~E/CH// = 'K' 

3 //CH// - 'CH' 

4 / S/CI/i7 = 'S-I ' 

5 //CI/A/ - ' SH' 

6 //CI/C/ - ' SH ' 

7 //CI/EH/ = 'SE' 
S //C/+/ = 'S' 

S //CK// = ' K' 

10 //COM/%/ = 'K-AK-M' 

11 //C// = 'K' 

**** d rulss *"**•* 



1 

2 
3 
4 

6 
7 
8 
9 
10 



/*:/DHD/ / » 'D-IK-D' 

y.E/D/ / = 'D' 

/#-:E/D/ / = 'T' 

/ /DE/r#/ « 'D-IK' 

/ /DO/ / = 'JD-OC 

/ /DOES// = 'B-GK-Z' 

/ /OQIXO// = 'D-OO-IH-NG' 

/ /DQW// = 'D-OW 

//DU/A/ = 'J-OO' 

//D// « 'D' 
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**** E rules **** 



**** H rules **** 



1 
2 
3 
4 
5 
6 
7 
3 
9 
10 
11 
12 
13 
14 
15 
15 
17 
IS 
13 
20 
21 
22 
23 
24 
2 5 
25 
27 
23 
29 
2 0 
3 1 
32 



36 
37 



/ r 

s ' / 
'EE' 



#s/E/ / 3 
'-:/£/./ 

:/E/ / a 
#/XD/ / - 'D' 
#:/E/D / = " 
/SV/ER/ » 'EK-V 
EL/ EVEN/ / = 'EK-V-EH-N' 
S/EVEN// = 'EH-V-EH-N' 
/£/**/ « 'EE' 
/E/PH%/ = 1 EE' 
/ERI/#/ » 'EE-R-EE' 
/ J- /EH/// = 'EE' 
/ER/#/ = ' EH-R' 
/ER// « 'ER' 

/EVEN// = ' EE-V-EH-N 7 
*:/E/W/ " 
g/EW// - '00' 
/EW// = 'Y-OO' 
/E/O/ = ' EE' 
fil/*S/ / « 'IH-Z' 
#:/E/S / - " 
^ : /ELY/ / = ' L-EX ' 
/s'/EMEKT// - 'M-EH-N-1' 
/EPUL// = 'F-U-L'. 
/EE// = ' EE 7 
/EARN// = ' ER-N' 

/EAR/-/ - ' ER' 
/ EAD / / = ' EH-D ' 
#:/EA/ / » ' EE-UK' 
/EA/SC/ = 
/EA// = 
/EIGH// 

'EE' 
= 'I' 
'EE' 
'Y-OO' 



- 'Efi' 
' EE' 



/EX// - 
/EYE// 
/EY// - 
/EU// = 



/E// = 'EE' 



Avxt p rules **** 

1 //FUL// = 'x-U-L' 

2 //F// = 'F' 

**** g rules **** 

1 //GIV// - 'G-IE-V 

2 / /G/I~/ « 'G' 

3 / / GE/T/ - ' G-EH' 

4 /SU/GGES// « 'G-J-EH-SS' 

5 //GG// = ' G ' 

€ / E//G// « G' 

7 //G/+/ - 'J' 

3 //GREAT// - 

S /r 1 / GH// - " 

10 //G// - 'G' 



1 
2 
3 
4 
5 
6 



/ /HAV// = 'H-AE-V 
/ /HERE// » 'H-EE-R' 
/ /HOUR// - 'CW-ER' 
//HOW// « 'E-GW 
//K/#/ = 'H' 
//H// = " 



* I rules **** 



1 
2 
3 
4 
5 

6 
7 
8 
9 
1C 
11 
12 
13 
14 
15 
15 
17 
IS 
13 
20 
21 
22 
23 
24 
25 
25 
27 
23 
29 
30 
31 



/ /IK// = 'IH-N' 

/ m i - 'i' 

//IN/D/ = 'I-N' 
//IZR// » 'EE-ER' 
/#:R/IED// * 'ZE-D' 
//XED/ / • 'I-D' 
/ / XEK/ / - ' EE-EH-N ' 
//IE/T/ = 'I-EK' 
/ :/I/V - 'I' 
//!/*/ « ' EE' 
//IE// = ' EE' 
/N/XNE// - 'I-N' 
/T/IKZ// = 'I-K' 
//I/*+:#/ s 
l/TPJtf = 
//IS/%/ = 
//IX/*/ = 
//IZ/k/ = 
//I/D*/ - 
/+-/!/*+/ 
//I/T%/ - 
/?-:/!/-+/ = 

//i/-*/ - 'I' 

//IR// - 'ER' 
/ /IGH// = 'I 7 
/ / ILD/ / * 'I-L-D' 
//IGK/ / = 'I-N' 
//IGN/V = 'I-N' 
/ / IGN/ t» / = 'I-N' 
//XQUE// = ' EE-K ' 
//I// = 'IK' 



- 'IH' 
'I-R' 
'I-S' 
'IH-K-S' 
'1-2' 
'X' 

= 'IH' 
'X' 

IH' 



■k * * * 



1 //J// - ,J ' 
**** k rules **** 

i //x/k/ = ' ; 

■A^ix l rules **** 

1 IJ-LOfCtJ = 'L-OH' 

2 /L/L// 



//1EAU// = 'L-E--0' 



4 //lEAU/ 

5 //!»// 58 ' L ' 
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w 



15 



20 



25 



30 



35 



40 



45 



50 



M rules **** 

1 //XflV//» 'H-QO-V 

2 //M// = 

**** N rules **** 

1 /S'/NG/t/ = 'M-J' 

2 //KG/2/ = 'KG' 

3 //NG/// * 'NG' 

4 //NGL/*/ = 'NG-UH-L' 

5 //KG/ / = 'NG ' 

6 //NK// = 'N-X' 

7 / /NOW/ / =» 'N-CW 
S //N// - 'N' 



**** o rules 



1 
2 
3 
t 
5 
6 
7 

a 

9 
1G 
11 
12 
13 
14 
15 
15 
17 
IE 
19 
20 
21 
22 
23 
24 
2 5 
2 5 
27 
23 
23 
33 
31 

2 2 
33 
34 
35 
35 
37 
23 

3 3 
i 



//OF/ / * 'UE-V 
//OROUCH// = 'ER-CH' 
/ F/CR/TY/ * 'Oh>E' 
/i?-:/OR/ / « 'ER' 
/£:/ORS/ / - 'ZR-2' 
//OR// =■ 'AW-R' 
/ yCMZ// - 'W-UH-N' 
//Ow/EL/ - 'OVJ' 
//CW// * 'CH' 
7 /OVER// - 'OE-V-ER' 
//OV// - 'UK-V 
//.0/*%/ - 'OH' 
//0/*EN/ = 'OK 7 
//0/-I#/ = 'OH' 
//OL/D/ = 'OH-L' 
//OUGHT// = ' AH-T' 
//OUGH// = 'UK-F' 
/ JQV/~L/ = 'CE' 

= 'car' 
'CK-S' 
- 'OE-R' 
' AW-R 7 
' C-D/ 
'GO-P' 



/ /Cu// - 
/K/QU/S*/ 
//OCJS// = 
/ F/CUR// 
//OUR// = 
//O.UD// = 
//CU?// = 

//ou// 
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//CY// - 'AW'-EE' 
//QXNG// » 'OK-IK-NG' 
//OI// = ' AK— EE ' 
//OOR// = ' OH-R ' 
//OCK// - 'CJ-X' 
//OOD// = 'U-D' 
//OO// ■= ' CO ' 
//O/E/ = 'OH 7 
//O/ / » 'OH' 
//OA/ / = 'CH' 
/ #ONLY/ / =* 'OE-N -L-EE' 
/ 'JfOKCS// = 'W-UK-S-S' 
//ON'T/ / - 'CE-N-T' 
/C/C/N/ « 'AH' 
//0/NG/ - 'AH' 



42 /~:/0/N/ = 'UH' 

43 /I/ON// = 'UH-N' 

44 /*:/Otf/ / = 'UH-N> 

45 //*/CN// = 'UH-N' 

46 //O/ST / = 'OK' 

47 //OF/*/ = 'AW-?' . 

43 //OTHER// = ' UH-TH-ER ' 

49 //OSS/ / - 'AK-3' 

50 /# A :/CM/ / = ' UK-K ' 

51 //C// = ' AK' 

p rules **** 

1 //FH// - 'F' 

2 //PEOP// = 'P-EE-P' 

3 //?CW// = 'P-OW 

4 //PUT/ / - 'F-U-T' 

5 //P// - 'F' 

**** Q rules **** 

1 //QUAR// - 'K-V-AW-R' 

2 //QU// = 

3 //Q// - 'K' 

**** ^ rules 

1 / /RE/^// - ' R-EE ' 

2 //R// - ' R' 
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**** S rules **** 



**+* u rulas **** 



10 



15 



20 



25 



30 



35 



40 



45 



1 

2 
3 
4 
5 
6 
7 
o 
S 
10 

11 

12 
13 
14 
15 
15 
17 
13 
IS 
20 
21 
22 
23 



//SH// =- 'SK' 
///SION// « 'ZK-CTH-N' 
//SOME// » 'S-AK-H' 
///SUR/// - 'ZH-ER' 
//STIR/// - ' SH-ER' 
///SU/// « 'ZH-CO' 
///SSU/// - 'SH-CO' 
///SED/ / - 'Z~D' 
///S/*/ - 'Z' 
//SAID// - 'S-EH-D' 
/r/SION// * 'SH-CH-N' 
//S/S/ ~ " 
/../S/ / = 'Z' 
/#:.E/S/ / = 'Z' 
(*~l**/S/ I = 'Z' 
/#*://S/ / = 'S 7 

/U/S/ / = 'S' 
/ :#/S/ / = 'Z 7 
/ /SCH// = 'S-X' 
//5/C-/ - " 
///SW// ^ 'Z-M 7 
///SK/ / « 'Z-UH-N' 
//S// - 'S' 



**«* ^ rules **** 



50 



2 
3 

5 

Q 

7 
3 
9 
10 
11 
12 
13 
14 
15 
16 
17 
13 



13 
20 
21 
22 
23 
24 
25 
25 



/ /THE/ / « ' TK-uK 7 
I /TO/ / » "T-OO' 
//THAT// = 7 TB-AE-T 7 
/ /THIS / / =» 7 TK-IH-S 7 
/ /THEY// = 7 TH-EY' 
/ ./THERE// *= 'TH-EK-R 7 
//TH5R// = 7 TH-ER 7 
/ /THEIR/ / ~ 'TH-SH-R' 
/ / THAN / / = 'TH-AE-K' 
/ /TKEM/ / = 'TK-EK-M' 
//THESE/ / = 7 TK-EE-Z 7 
/ /THEN// = 
//THROUGH// 
//THOSE// = 
//'THOUGH/ / 
/ /THUS// = 
//TE// = 7 TK ' 
//: /TED/ / = 'T-IH-D 



7 TK-EH-.M 7 
- 'TK-R-OO' 
'TH-OH-Z' 
* 'TH-Ccl 7 
'TK-OE-S' 



/S/TI//H/ = 'CE' 
//TI/O/ = 'SH r 
//TI/A/ - 'T 7 
//TIEN// = 'SK-UK-tf' 
//TOR/// = 7 CH-ER 7 
//TU/A/ « 'CK-CG 7 
/ /TWO// « 'T-CO' 
//T// = 'T' 



1 / /UN/I/ 'Y-OO-M' 

2 / /UN// = 'UE-N' 

3 / /U?Otf// - 'UK-P-AW-N' 

4 /3/UR/// - 'ER' 

5 //CR/// = ' Y-ER 7 

6 //OR// = 'ER' 

7 / * ,UH ' 
a //u/"/ - ' CJK' 
S //UY// = 'I' 

10 / G/U/?7 = " 

11 /G/U/V = " 

12 /G/U/?/ - 7 W 7 

13 I inn It - '¥-oo' 

14 /a/ci// » 'oo' 

15 /e/u// « 'co' 

15 //LV/ = 'WC 



**** V rulei 



1 //VIEW// = 'V-Y-OO' 

2 //V// - 'V 

**** w rulas **** 



I 

2 
3 
4 
5 
6 
7 
8 

Q 
1C 
11 

12 



/ /wHERE// = 7 W-ER 7 
//WA/5/ » 'W-AH 7 
//WA/T/ = 7 W-AH 7 
/ /WHERE/ / = 'WH-EH-R' 
//WHAT// = 7 WH-AH-T 7 
//WHOL// = 7 H-OH-L 7 

//who// = 'H-oo 7 

/ /WE/ / = 7 WH' 
//WAR// = 'W-AH-E 7 
//WOR// = 7 W-ER' 
//WR// 'R 7 
//W// = 'W 



i //x// - ':<-s 7 
v rules 

1 //YO0M&// = 'Y-VK-NG' 

2 / /YCC// = ' Y-CC 

3 / /Y2S// = 'X-ZH-S' 

4 / /Y// = ' Y ' 

5 /?*:/•£/ I = 'EE' 
5 /?~:/Y/ 1/ = 'EE' 

7 /:/*//-'!' 

s / :/*//#/ - 'I' 

S / :/Y/*+:#/ = 'IH' 

10 / :/Y/-i7 = 'I' 

11 //Y// = 'IH' 
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**** 


2 rules **** 


5 


1 


//ZZ// = 'T-Z' 




2 


//Z// = 'Z' 




**** 


Kisc. rules **** 


10 


1 


//'// = " 




2 


// // - ' ' 




3 


If, // = ' ' 




4 


//;// - ' ' 






//:// = 7 ' 


15 


6 





7 //I// = ' ' 

s inn = ' ' 

20 



Claims 

25 

1 . A mobile data terminal for receiving and converting transmission data indicative of text data strings into an audible 
signal comprising: 

a receiver for receiving said transmission data through a communication network; 
30 a first memory for storing said received transmission data; 

a processor for obtaining from said stored transmission data speech segments corresponding to said text data 
strings; and 

an audio generator for generating audible signals representative of said speech segments. 

35 2. The mobile data terminal according to Claim 1 , wherein said transmission data includes data indicative of words. 

3. The mobile data terminal according to Claim 2, wherein said transmission data includes data indicative of speech 
segments. 

40 4. The mobile data terminal according to any one of the preceding claims, wherein said transmission data includes 
data indicative of text segments and wherein said data indicative of text segments are associated with at least one 
of syllables and words. 

5. The mobile data terminal according to any one of the preceding claims, wherein said communication network is 
45 associated with at least one of Cellular Digital Packet Data (CDPD), satellite, Mobitex data, Ardis data, Specialized 

Mobile Radio, GSM, PCS. 

6. The mobile data terminal according to any one of the preceding claims, wherein said transmission data is coded, 
and wherein said mobile data terminal further includes a decoder, and wherein said coded transmission data is 

50 decoded in said mobile data terminal. 

7. The mobile data terminal according to Claim 6, wherein said words are classified to at least two groups, a first 
group including a priori known words, constituting first group of words, and a second group including new words, 
constituting second group words, and wherein the transmission data with respect to each one of the first group 

55 words is a code representative of speech segments that correspond to a complete first group word, and the trans- 

mission data with respect to each one of the second group words is at least one code the or each code being 
representative of a respective speech segment that corresponds to at least a portion of said second group word. 
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8. The mobile data terminal according to Claim 7, wherein speech segments that correspond to a first group word 
are extracted from said coded transmission data and said mobile data terminal further includes a feeder for feeding 
said extracted speech segments to said audio generator for generating audio signals representative of said speech 
segments. 

5 

9. The mobile data terminal according to Claim 7, wherein speech segments that correspond to a second group word 
are extracted from said coded transmission data and said mobile data terminal f urthe r includes a feeder for feeding 
said extracted speech segments to said audio generator for generating audio signals representative of said speech 
segments. 

w 

1 0. The mobile data terminal according to any one of the preceding Claims, wherein said audible signals are associated 
with at least one of synthesized speech signals, speech signals. 

11. The mobile data terminal according to any one of the preceding claims, wherein said transmission data is com- 
is pressed and wherein said mobile data terminal further comprises a decompressor for decompressing said com- 
pressed transmission data. 

12. The mobile data terminal according to any one of the preceding claims, wherein said mobile data terminal further 
comprises: 

20 

a transmitter for transmitting transmission data; 

a coder for coding transmission data; and 

a compressor for compressing transmission data. 

25 13. A method for transmitting, receiving and converting transmission data indicative of text data strings originated by 
or received from a first device, into audible signals indicative of said data text strings in at least one second device; 
either or both of said first device and the at least one second device being a mobile data terminal mountable on a 
mobile platform; at least one of said first device and said second device includes a table of speech segments 
corresponding each to a word or a portion thereof; the method comprising the following steps: 

30 

(i) reducing said text data strings to words; 

(ii) associating with said words corresponding speech segments; 

(iii) transmitting through a communication network from said first device to the at least one second device said 
transmission data; 

35 (iv) receivin, through said communication network, in the or each one of said second devices said transmission 

data; 

(v) generating in said at least one second device audio signals representative of said speech segments. 

14. The method according to Claim 13, wherein said step (i) is executed in said first device. 

40 

15. The method according to Claim 14, wherein said step (ii) is executed in said first device. 

16. The method according to Claim 14, wherein said step (ii) is executed in said at least one second device. 

45 17. The method according to Claim 1 3, wherein said steps (i) apd (ii) are executed in said at least one second device. 

18. The method according to Claim 13, wherein said transmission data includes data indicative of words. 

19. The method according to any one of Claims 1 3 and 17, wherein said transmission data includes data indicative of 
50 speech segments. 

20. The method according to any one of Claims 1 3, 1 8 and 1 9, wherein said transmission data includes data indicative 
of text segments and wherein said data indicative of text segments are associated with at least one of syllables 
and words. 

55 

21. The method according to any one of Claims 13 to 20, wherein said communication network is associated with at 
least one of Cellular Digital Packet Data (CDPD), satellite, Mobitex data, Ardis data, Specialized Mobile Radio, 
GSM, PCS. 
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22. The method according to any one of Claims 13 to 21, wherein said transmission data is coded, and wherein said 
coded transmission data is decoded in said at least one second device. 

23. The method according to Claim 22, wherein said words are classified to at least two groups, a first group including 
a priori known words, constituting first group of words, and a second group including new words, constituting 
second group words, and wherein the transmission data with respect to each one of the first group words is a code 
representative of speech segments that correspond to a complete first group word, and the transmission data with 
respect to each one of the second group words is at least one code representative of a respective speech segment 
that corresponds to at least a portion of said second group word. 

24. The method according to Claim 23, wherein speech segments that correspond to a first group word are extracted 
from said coded transmission data and said mobile data terminal further includes a feeder forfeeding said extracted 
speech segments to said audio generator for generating audio signals representative of said speech segments. 

25. The method according to Claim 23, wherein speech segments that correspond to a second group word are ex- 
tracted from said coded transmission data and said mobile data terminal further includes a feeder for feeding said 
extracted speech segments to said audio generator for generating audio signals representative of said speech 
segments. 

26. The method according to any one of Claims 13 to 25, wherein said audible signals are associated with at least 
one of synthesized speech signals, speech signals. 
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